A Signal-adaptive Local Cosine Transform for Source Separation by Time-frequency Masking
نویسندگان
چکیده
Time-frequency masking is often used for source separation of underdetermined audio mixtures. It depends on the fact that the sources can be represented disjointly in some transform domain. The focus of this paper is on demixing sources from instantaneous, two-channel mixtures by binary masking. We investigate trees of local cosine bases from which a suitable transform may be generated—the best basis is chosen by a computationally efficient algorithm and is adaptively selected to match the time-varying characterists of the signal. Our heuristically motivated cost function maximises the energy of the transform coefficients associated with each estimated source. Finally, we evaluate our proposed transform by comparing it against two well-known transforms: the shorttime Fourier transform and the modified discrete cosine transform. We assume that the mixing parameters are known. Our results show that in some cases, our method can give better results than these fixed-basis representations.
منابع مشابه
Oracle Evaluation of Flexible Adaptive Transforms for Underdetermined Audio Source Separation
We describe and apply a flexible, adaptive cosine packet transform to separate audio sources from instantaneous, underdetermined audio mixtures by time-frequency masking. Previously studied adaptive transform schemes have two main drawbacks: the signal can only be partitioned into dyadic intervals, and the profiles of the overlapping windows are often very short, thus tapering off very quickly....
متن کاملcentre for digital music An Adaptive Stereo Basis Method for Convolutive Blind Audio Source Separation
We consider the problem of convolutive blind source separation of stereo mixtures. This is often tackled using frequency-domain independent component analysis (FD-ICA), or time-frequency masking methods such as DUET. In these methods, the short-term Fourier transform (STFT) is used to transform the signal into the time-frequency domain. Instead of using a fixed time-frequency transform on each ...
متن کاملAdaptive Segmentation with Optimal Window Length Scheme using Fractal Dimension and Wavelet Transform
In many signal processing applications, such as EEG analysis, the non-stationary signal is often required to be segmented into small epochs. This is accomplished by drawing the boundaries of signal at time instances where its statistical characteristics, such as amplitude and/or frequency, change. In the proposed method, the original signal is initially decomposed into signals with different fr...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کاملExtension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation
We apply sparse, fast and flexible adaptive lapped orthogonal transforms to underdetermined audio source separation using the time-frequency masking framework. This normally requires the sources to overlap as little as possible in the time-frequency plane. In this work, we apply our adaptive transform schemes to the semiblind case, in which the mixing system is already known, but the sources ar...
متن کامل